Name | Version | Summary | date |
---|---|---|---|
msclap | 1.3.3 | CLAP (Contrastive Language-Audio Pretraining) is a model that learns acoustic concepts from natural language supervision and enables “Zero-Shot” inference. The model has been extensively evaluated in 26 audio downstream tasks achieving SoTA in several of them including classification, retrieval, and captioning. | 2023-10-20 21:18:51 |
hour | day | week | total |
---|---|---|---|
20 | 1604 | 10863 | 266051 |